The Next Generation’s Personal File System Management
نویسندگان
چکیده
The current file systems are hierarchical, which can cause duplicate storage and cannot represent human’s mind map. In this paper, we explore the possibility of a heuristic, relational personal file system. Regarding each file as a node in the graph, we implement K-means, EM, LDA and Tree Bagging algorithms respectively to group the related files. In this way, we convert the current hierarchical file system to relational file system. We compare the results of these algorithms, the error of K-means, EM, LDA and random forest are 62.33%, 61.4%, 42.33% and 16% respectively. Among all the unsupervised learning, LDA a popular and generative model in topic modeling gives the best accuracy, but still does not surpass supervised learning. Therefore we propose to combine LDA and Tree Bagging algorithm, using the semi-supervised learning to classify the files in the future. In the end, we also discussed the potential of combining latent fator model with above methods to classify a large scale of file sets, thus extending our method from personal file system to corporate file system.
منابع مشابه
XML Security in Certificate Management – XML Certificator
The trend of rapid growing use of XML format in data/document management system reveals that security measures should be urgently considered into next generation’s data/document systems. This paper presents a new certificate management system developed on the basis of XML security mechanisms. The system is supported by the theories of XML security as well as Object oriented technology and datab...
متن کاملOutlook for the Next Generation’s Precision Forestry in Finland
During the past decade in forest mapping and monitoring applications, the ability to acquire spatially accurate, 3D remote-sensing information by means of laser scanning, digital stereo imagery and radar imagery has been a major turning point. These 3D data sets that use singleor multi-temporal point clouds enable a wide range of applications when combined with other geoinformation and logging ...
متن کاملXML Security in Certificate Management
Thetrend of rapid growing use of XML format in data/document management system reveals that security measures should be urgently considered into next generation’s data/document systems. This paper presents a new certificate management system developed on the basis of XML security mechanisms. The system is supported by the theories of XML security as well as Object oriented technology and databa...
متن کاملA qualitative study on personal information management (PIM) in clinical and basic sciences faculty members of a medical university in Iran
Background: Personal Information Management (PIM) refers to the tools and activities to save and retrieve personal information for future uses. This study examined the PIM activities of faculty members of Iran University of Medical Sciences (IUMS) regarding their preferred PIM tools and four aspects of acquiring, organizing, storing and retrieving personal information. Methods : The qualita...
متن کاملRequirements for a Next Generation Personal File Manager
Scientists, engineers, knowledge workers and others need help managing their personal data files and the programs that manipulate their data. The current generation of software for supporting their needs, which we call Personal File Managers (PFMs), is not adequate. We propose five requirements that a next generation PFM should satisfy. We have created a mockup of a PFM which satisfies these re...
متن کامل